A Joint Domain-Specific Pre-Training Method Based on Data Enhancement

نویسندگان

چکیده

State-of-the-art performances for natural language processing tasks are achieved by supervised learning, specifically, fine-tuning pre-trained models such as BERT (Bidirectional Encoder Representation from Transformers). With increasingly accurate models, the size of fine-tuned pre-training corpus is becoming larger and larger. However, very few studies have explored selection corpus. Therefore, this paper proposes a data enhancement-based domain method. At first, task downstream jointly trained to alleviate catastrophic forgetting problem generated existing classical methods. Then, based on hard-to-classify texts identified tasks’ feedback, can be reconstructed selecting similar it. The learning deepen model’s understanding undeterminable text expressions, thus enhancing feature extraction ability texts. Without any pre-processing corpus, experiments conducted two tasks, named entity recognition (NER) classification (CLS). results show that selected proposed method supplement domain-specific information improve performance basic model achieve best compared with other benchmark

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

Domain-specific enhancement of metacognitive ability following meditation training.

Contemplative mental practices aim to enable individuals to develop greater awareness of their own cognitive and affective states through repeated examination of first-person experience. Recent cross-sectional studies of long-term meditation practitioners suggest that the subjective reports of such individuals are better calibrated with objective indices; however, the impact of mental training ...

متن کامل

A HMM-based pre-training approach for sequential data

Much recent research highlighted the critical role of unsupervised pre-training to improve the performance of neural network models. However, extensions of those architectures to the temporal domain introduce additional issues, which often prevent to obtain good performance in a reasonable time. We propose a novel approach to pre-train sequential neural networks in which a simpler, approximate ...

متن کامل

Diagnosis of diabetes by using a data mining method based on native data

Background & Aim: Detecting the abnormal performance of diabetes and subsequently getting proper treatment can reduce the mortality associated with the disease. Also, timely diagnosis will result in irreversible complications for the patient. The aim of this study was to determine the status of diabetes mellitus using data mining techniques. Methods: This is an analytical study and its databas...

متن کامل

A Probabilistic Three-Phase Time Domain Electric Arc Furnace Model based on analytical method

An electric arc furnace (EAF) is known as nonlinear and time variant load that causes power quality (PQ) problems such as, current, voltage and current harmonics, voltage flicker, frequency changes in power system. One of the most important problems to study the EAF behavior is the choice of a suitable model for this load. Hence, in this paper, a probabilistic three-phase model is proposed base...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2023

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app13074115